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Various best-choice problems related to the planar homogeneous Poisson process in finite 
or semi-infinite rectangle are studied. The analysis is largely based on properties of the one- 
dimensional box-area process associated with the sequence of records. We prove a series of 
distributional identities involving exponential and uniform random variables, and resolve the 
Petruccelli-Porosinski-Samuels paradox on coincidence of asymptotic values in certain discrete- 
time optimal stopping problems. 

1 Introduction 

On a recent conference on optimal stopping Steve Samuels reported a remarkable coincidence 
of the asymptotic values in two quite different best-choice problems ||2^ - 



Let (Xj) be a sequence of independent uniform [0, 1] random variables. Let 7^ be the family 
of all stopping times t < n adapted to the natural filtration of the sequence, and Tin be the 
subclass of stopping times adapted to the sequence {Sj ,Tj) where 

Sj = max (Xi, . . . , Xj) - min (Xi, . . . , Xj) and T,- = l{x,=max(Xi,...,x,)} 

are the range and the indicator of an upper record at index j, respectively. For N uniformly 
distributed on {1, . . . ,n} and independent of (Xj) define 

Un = sup P{Xr = max (Xi, . . . , Xat), t < N) 

and define another stopping value 

Wn = sup P{Xr = max (Xi, . . . , X„)). 

Tin 

Then, as pointed out by Samuels, the limits are the same 
(1) lim Un = lim Wn 



n— >oo 



and coincide with the value vp := liuin^ooWn established by Petruccelli ||21[ . 

The values m„ and w„ arise in best-choice models representing very different informational 
situations of the observer. The value Un is the optimum probability of stopping at the maximum 
of a sequence sampled from known probability distribution, when the observer has incomplete 



information about the length of the sequence X, see Porosinski pSf. The value Wn appears as 
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the minimax probability of stopping at the maximum of a random sequence with definite length 
n, but only partial information about the distribution of observations: the observer knows that 
the underlying distribution is uniform on a unit interval but is ignorant of the position of the 
interval, see Petruccelli |2T|. 



It had been noticed by Porosinski that the two problems have optimal policies with the 
same collection of thresholds and that numerical values of m„ suggest unmistakable convergence 
to vp. However, the concidence of policies does not imply coincidence of stopping values, as is 
seen from the numerical values tabulated in and in an unpublished Petruccelli's thesis. In 
2^1 Porosinski gave a false argument for (|I]) which, however, involved a computation with the 



right answer. 

Both models are offsprings of the basic problem introduced by Gilbert and Mosteller as 
the 'full-information game' [|12[, where the objective is to maximise P{Xr = max (Xi, . . . , X„)) 



over T & %i. In this case the observer knows n and the distribution of observations and aims 
to recognise the maximum at the moment it appears. This loose name was attached to the 
problem to stress the contrast with the classical best-choice or secretary problem where no 
information about the distribution is available and the policy is to be based only on relative 
ranks (or, in other version, on record times [0). 

Samuels explained that the correct answer in |^ resulted from yet another coincidence: the 
common optimal policy yields the same best-choice probability in the full-information problem. 
The now threefold coincidence was reinforced by Tamaki and Mazalov |^ who noted that the 



same limit appears in connection with the problem of maximising the inter-record time, as 
studied in |]TI| . 

To justify (|l]) Samuels used Poisson approximation to express the limit values via certain 
multivariate integrals which he evaluated partly analytically, partly using numerical integration. 
He then concluded that this kind of argument does not really explains the phenomenon, because 
the random processes underlying m„ and w„ are of very different nature and do not seem to 
admit a kind of coupling, even asymptotically. As for the coincidence of optimal policies, it 
was derived from the fact that two different mixtures of binomial distributions - one uniform 
in n and another uniform in p - yield the same distribution, see | |29[| . 

In this paper we argue that (|^) and further coincidences are by no means incidental, rather 
exemplify properties of various Markov chains induced by records from the homogeneous Planar 
Poisson Process (PPP). Essentially the same reason which leads in the discrete-time setting 
to the coincidence of optimal policies unravels in the PPP setting as a characterisation of the 
box-area process which measures the predicted intensity of PPP-records in a given rectangle. 
Our explanation to ([l|) is that 

proper Poisson versions of Porosinski and Petruccelli problems with same size- 
parameter t can be reduced to optimal stopping of the same one- dimensional box- 
area process for any value of t. 



We adopt the following well-known framework (also see 10], 0! @]) 0! HHjIH] for similar 



approaches). Consider the PPP restricted to a given rectangle R of area t (with the conventional 
orientation). Suppose the rectangle is scanned from the left to the right by shifting a vertical 
detector and that scanning can be stopped each time an atom of the PPP is detected. Different 
objectives and constraints are considered. 
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(FI): In the full-information problem R is known and the objective is to stop at the highest 
PPP-atom in R. 

(VC): In the vertical cut problem, R is partitioned by a vertical line V drawn through a random 
uniform point selected on the upper side of the rectangle. The observer, who does not 
know V aims to stop scanning at the point highest among the Poisson points in R which 
are to the left from the cut V. 

(HC): In the horizontal cut problem, R is partitioned by a horizontal line H drawn through a 
random uniform point on the left side of the rectangle. The observer aims to stop scanning 
at the point highest among the Poisson points in R below the cut H. The observer does 
not know H but each time an atom is detected she learns if the atom is above or below 
H. 

Let u(t) and w{t) be the optimum probabilities in the VC- and HC-problems, respectively. We 
will show that 

(2) u{t) = w{t) 

and give explicit formulas for the value. The common limit vp will be given interpretations as 
the optimal probability of the best choice in a t = cx) model. 

Generally speaking best-choice problems belong to the province of extremes and records, and 



there is a well-developed theory of these structures, see [T^ and a survey |jTj]. However, 
for evaluating stopping policies one needs to consider records satisfying variable constraints, 
and the theory does not cover this subject yet. 

In brief, our plan is as follows. We start in Section 2 with thorough analysis of structures un- 
derlying the Fl-problem, we present a new complete solution and closed-form formulas, online 
connection to an optimal control problem and give various representations of the best-choice 
probability. A principal novation is the box-area process which we describe as a regenerative 
process, design a EU-representation (exponential-uniform) for the path and prove a character- 
isation via the distribution of the number of visits in an interval. In Section 3 we modify the 
box-area process to adopt it to the the VC-problem, derive an analytical expression for u{t) 
and draw a parallel between the box-area process and the classical Poisson process. In Section 
4 we analyse upper and lower record processes and proceed with three different proofs of (0). 
The relation (0) itself becomes embedded into a series of distributional identities involving ra- 
tional functions in exponential and uniform random variables. In Section 5 we give a sample of 
extensions, reduce the duration problem to the VC-problem and finally give a formula for the 



winning rate, thus fixing a loose end from |12 . 



2 Records, box areas and the full- information problem. 

2.1 Prerequisites. We will consider the homogeneous PPP, which has the Lebesgue measure 
as intensity. The properties of the PPP which will be used without further reference are: 

The number of PPP-points (referred to here as atoms) in each bounded domain has 
Poisson distribution with mean equal to the area of the domain. 

The random variables counting the atoms in disjoint domains are independent. 
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For any rectangle R, projections of PPP atoms a G -R on adjacent sides of R yield one- 
dimensional homogeneous Poisson processes (which are conditionally independent point 
processes given the number of atoms in R). 

For any rectangle R, conditionally on the number of atoms in i?, say the law of PPP 
in R is the same as that of the point process induced by a sample of n i.i.d. points from 
the uniform distribution in R. 

We will use the following notation for exponential integral functions 

I{t, s) := f/—^^, Jit) ■■= I , I{s) = /(oo, s) 

(see p], ||2^ for detailed study of these and other functions related to the incomplete gamma- 
function). 

We consider only rectangles with sides parallel to coordinate axes. Given a rectangle R, an 
atom a G -R is said to be a record if there are no other atoms in R to the north- west of a. The 
part of R to the north-east of a will be called the box attributed to a and its area a (a) will be 
called the box area. 

If two rectangles Ri and R2 have the same area, there is an afiine isomorphism cf) between 
them which respects both the measure and the natural partial order. It follows that the </)-image 
of the PPP in Ri is a version of the PPP in i?2, with same records and box areas. This kind of 
self-similarity is crucial for the models to follow, and the only essential parameter of a rectangle 
will be its area. 

Throughout we denote this basic parameter by t. Different interpretations are possible: in 
case R = [0,t] x [0,1], the parameter will be implicitly understood as a time horizon for a 
'sequence of marked items arriving in a Poisson manner', while for R = [0, 1] x [0,t] one can 
think of [0, 1] as a time scale and of [0,t] as a scale for 'qualities of random items'. However, 
the reader should accept thinking in terms of areas and be prepared for the models like best- 
choice in a square with side-size In case t = 00 we consider PPP in the semifinite strip 
[O,l]x]-oo,0]. 

Denoting Pj{t) the probability of j records in R we have 

where ai{k,j) are signless Stirling numbers of the first kind (= for k < j). This formula 
follows from the analogous fact about random permutations (see e.g. |]IB|]), because if there are 
k atoms in R all their k\ rankings on the vertical scale are equally likely. Two special cases of 
the formula will be most important: 

00 j.k 
k=l 

Many recursions involving records in R are obtained by conditioning on the area in R to 
the left from the leftmost atom, say a, which is also the first (i.e. leftmost) record. When 
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R = [0,t] X [0, 1] this area is just the horizontal coordinate of a. In this hne, we have for the 
number of records a recursion 



Pj{t) = J e'^ ^ ds J pj_i{sx)dx. 



10 Jo 

Exchanging the order of integration this becomes 

(4) pjit) = e-*|V,_i(s) (j(t) - J(s) +log ^) ds 

and shows that all functions Pj(t) are obtained by repeated integration of po(^) = e~* with the 
same kernel. Same recursion in differential form is 

(5) p'm = -p^it) + fp,^i{s) ds, p,(0) = 0. 







Another recursion can be proved by induction: 

(6) ft (*) = e-' /' (-i)'-Vi(-'')-ft-W«- 



10 s 

Starting from po{t) = e~* this yields already determined pi(t) = e^^J{t), then 

t f' -J{-s)e' - J{s) ^ 
P2{t) = e-' / ^ — ^ ds 



and so forth. Note that the power series for pjitys define entire functions thus substitution of 
negative values of t does make sense. 

Remark. The sequence of records can be viewed as a north-west Pareto boundary of the 
Poisson sample. This motivates yet another representations for Pj{t): as a multidimensional 
integral over the value of a bivariate sequence of records of length j, or as a one-dimensional 
integral over the area to the north-west of such a sequence. 



2.2 Probability of the best choice. Suppose an observer learns the configuration of PPP 
atoms by shifting a vertical detector from the left to the right. The objective of the observer 
is to correctly recognise the highest atom in a rectangle R at the moment the highest atom is 
detected. In the full-information problem it is assumed that the observer knows R exactly. 

Formally, a policy is a stopping time adapted to the PPP, and the performance index of a 
policy is the probability of stopping at the highest atom in R. In first turn, we are interested in 
an optimal policy which maximises the probability of stopping at the highest atom. Since the 
highest atom is the last (i.e. the rightmost) record in R it is always optimal to skip non-record 
observations. On the other hand, when a record a is observed furher records can appear only in 
the box attributed to a, and because the configuration of atoms in the box is independent on the 
configuration to the left from a, the box area a{a) alone determines the conditional probability 
law for the number of future records and the law of their configuration up to isomorhism. The 
conditional distribution of the number of records is obtained by substituting a{a) in place of t 
into (^, thus the decision to stop at a record or to skip it should depend only on the box area. 
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Let v{t) be the optimal probability of stopping at the highest atom. Dynamic programming 
approach calls for solving the equation (DP-equation) 

v{t) = / e*^*ds / max{po{sx),v{sx)) dx 
Jo Jo 

which is equivalent to the initial-value problem 

(7) v'{t) = -v{t) max{po{s),v{s))ds, v{0) = 0. 

Jo 

It is immediate from (0) that the solution is unique and at least C^-smooth for t > 0. However 
the equation is difficult to deal with directly, unless we learn how to resolve the max operator. 

A traditional resolution in the spirit of optimal stopping theory is as follows. Consider 
equation po{t) = Pi{t), which is equivalent to the transcendental equation J{t) = 1. There is a 
single positive root tp = 0.804352 . . . and we have 

Po{t) >Pi{t) ^ tKtp. 

Since the box areas can only decrease, this relation implies that we are in the so-called monotone 
case of optimal stopping and by a well-known argument v(t) > poit) ioi t > tp and 

(8) v{t) =pi{t) for t < tp. 

We could have come to the same conclusion by a more insightful method we call coupling. 
Consider a rectangle Ri = [0,1] x [— t, 0] and a smaller rectangle R2 = [0,1] x [—(t — 6),0]. 
Obviously, the records in R2 are records in Ri as well, although Ri may contain some more 
records in the strip [0, 1] x [—t, — {t — S)]. If the record sequence in Ri ever enters R2 it stays 
there forever, in which case the PPP in both rectangles has the same highest atom. Now, any 
stopping policy vr in R2 is also a legitimate policy for R2 and if tt succeeds to pick the highest 
atom in R2, this is also valid for Ri. Since vr can be arbitrary i?2-policy, we have v{t — 6) < v{t), 
i.e. the value function v{t) is increasing. At the same time, po{t) = e~* is decreasing, therefore 
there is a single match-point under the maximum and a minute thought shows that the match 
is at tp. 

It follows that the optimal policy is to select the first record which has the box area not 
exceeding tp, if any. For t <tp it is optimal to exploit the greedy policy which selects the very 
first detected record. 

The DP-equation (|^ can be easily solved by splitting the integral term at tp. With no 
extra effort we can do this in a more general framework. 

Define a threshold policy tTs to be the policy which stops at the first record with box area not 
exceeding s. Clearly, the optimal policy is ntp. The definition also covers the greedy policy tToo- 
(The maximum best-choice probability with tTqc is about 0.51735, attained at t = 1.50286 ■ ■ ■.) 

Warning. This definition is in terms of box areas, thus incorporates the self-similarity 
properties of PPP. Stopping rules akin to 'choose the first atom in R above a given level' are 
not threshold policies in our sense. 

The probability of the best choice with vr^ is equal to the probability, which we denote 
Pi{t, s), that there is a single record in R which has box area not exceeding s. In this case the 
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record is necessarily the last, and it is selected by vr^ while all preceeding records (if any) are 
skipped. By definition, pi (t, s) = pi (t) ioi t < s and for t > s satisfies 

(9) dtpi{t,s) = -pi{t,s)+t-' fp^(^^s)d^ + t-' fpolOde 

Js Jo 

as it follows by considering the first observed atom in R (which is also the first record). The 
boundary condition at s is pi{s, s) = pi{s). Equation (||) is partial but it is easily reduced to 
an ordinary differential equation with the help of the next lemma. 

Lemma 1 Given s > and a constant c suppose a function g is in C^[s, oo[ and satisfies 
equation 

g'{t) = -g{t) + g{0 + ^ , te[s,oo[. 

Then 

(10) git) = g'is)se'Iit,s)+gis). 
where g'{s) = —g{s) + cs~^ . 

Proof. Multiplying by t and differentiating we kill the integral term and reduce the equation to 

(11) tg"it) + it + l)g'it)=0. 
Separating variables yields 

g'(t) = ^se^g'{s). 

Integrating from s to t and matching a boundary condition at s gives the formula. □ 

Remark. Note that g{t) given by (|l^) is always monotone and for t ^ oo goes to a limit 
obtained via replacing I{t, s) by I{s). 

Applying lemma and writing solution in terms of the exponential integral functions, yields 
explicit formula for the performance of vr^ 

(12) pi{t, s) = I{t, s) e'sp[{s) + pi{s) = (e* - 1 - sJ{s)) I{t, s) + e"V(s), t > s. 
For optimal threshold we have J{tF) = 1, therefore 

(13) t;(t) = (e*^ -tF - l)/(t,tF) + e"*^ , t > t^- 

We see that for t > tp the optimal best-choice probability v{t) is a linear transform of the 
incomplete exponential integral. Passing to limit just amounts to taking the infinite integration 
bound: 

(14) VF:={e'^ -tF-l)I{tF) + e''^ 
with the approximate value 0.580164. 
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History and Remarks. The numerical value of vp was found in [0 by extrapolation 
of stopping values from the problem with fixed number of observations n. The exact formula 
for vp first appeared in p8| and is reproduced (with a sign flop) in ||30|. Samuels p9| and 
Porosinski |2^| also derived pi(oo,s) (our (|I2p with t = oo) by computing multidimensional 
integrals. The Poisson formulation appeared in p7| . 
found in 



and a power-series form of v{t) was 
see also [|l7l and Section 2.3 to follow. The box-area approach, formula (0) and 



its derivation are new. Partial differential equations for the value function appeared in and 
| 27| but they were left unsolved, apparently because the time-space invariance of the problem 
was not recognised. 



The transparent similarity of the finite t and t = oo formulas, highlighted by (0) and ([T3|) , 
stress a major advantage of the Poisson framework. Also, the convergence rate of pi{t,tF) to 
vp is better than exponential, determined solely by the convergence of the exponential integral. 
In the fixed-ra framework, the optimal probability decreases to vp, with convergence rate only 
of the order of (see [0). Another distinguished feature of the Poisson approach is that 
solving the stopping problem for arbitrary t essentially amounts to finding the optimum for 
small t, in contrast to discrete-time setting where the solutions differ wildly as n varies. 



2.3 Optimising the threshold. The optimal threshold tp has the property that the 
function dtpi{^, s) has no break at ,^ = s while there is a break for all other thresholds. This 
property characterises tp ClS db root of the equation 

(15) tp';{t) + it + i)p[it) = o 

which results from equating to the derivative 

dtp,{t, s) = I{t, s){p"{s) + {s + l)p'{s)) 

and is most closely related to the differential equation (|11]) of similar form. 

A deeper analysis going above the framework of this paper shows connection of the phe- 
nomenon with an optimal control problem, which becomes substantial when we consider other 
objectives and stopping sets more general than [0, s]. Here, we only establish the property in 
the context of a simple variational problem of finding an optimal switch. 

Write the objective functional pi{t, s) as an integral with compound integrand 

Piit,s)= rp[iOd^+ fdtp,{^,s)ds. 

Jo Js 

Suppose we begin sliding from = along the curve p'i{C,) and at each time s < t can switch 
to and keep sliding along another curve dtPi{C,, s) to ^ = t. Writing the first integrand in the 
form 

P'i(0 = ^(p'i(0ee-^) 

we see that switching at s means freezing the bracketed factor and proceeding with the integrand 

dtPi{^,s) = ^{p[{s)se'), 
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in accord with Lemma 1. 

Direct geometric argument shows that for an optimal switch the integrands must be tan- 
gential to each other at the switch location. Indeed, let A be the frozen factor. The quantity 
Ae~'^^~^ is increasing in A and goes to or becomes unbounded as ^ goes to oo or 0, respec- 
tively. On the other hand, is positive at and has a unique sign change from + to — , 
thus only A > can correspond to optimal switch. If at some location si the integrands meet 
transversally then there must be a further location S2 where they meet as well. Without loss 
of generality we can select S2 close enough to si to avoid further intersection points between 
them. In case si > S2 switching at S2 outperforms switching at si because in this case p'i{C) 
crosses A^~^e~^ from above. And in case si < S2 we improve si by passing to a tangential point 
between si and S2; thus winning a piece of the area squeezed between the intersection points 
and ending up with a larger A. 

A dual argument treats pi{t,s) as a function of the variable A. An optimal value of this 
parameter is then the largest among those values of A which make and A^~^ meet at 
some s < t. 

Equating derivatives of the integrands in ^ and then substituting ^ = s we get (|15D . On the 
other hand, from (j^) we find that for any t 

tp';{t)-{t + i)p'{t)=po{t)-p,{t) 

thus ([T5|) is equivalent to Po(t) = pi{t) and tp is the unique optimum switch location. (In case 
t < tp it is optimal to keep with the first integrand all the way.) 



2.3 Coupling. Coupling allows to consider best-choice problems simultaneously for all 
values of t and leads eventually to a t = 00 model. The following application of the method 



leads to a formula for dtPi{t, s) and, to an extent, unravels (0). 

Consider a rectangle -Ri = [0, 1] x [—t, 0] and a smaller rectangle R2 = [0, 1] x [— (t — 5), 0]. 
We wish to compare performance of threshold policy tTs in Ri and R2 for small 6. 

Suppose t > s. Clearly, when vr^ is applied to -Ri or R2 the outcomes can be different, but 
this distinction is limited to the event A that the first atom in Ri, say a, appears in the small 
rectangle [1 — s/t, 1] x [—t, — {t — 6)], up to a negligible event of probability o{6). In the event 
A there is no stop before the exploration process enters the domain [1 — s/t, 1] x [— 0] and 
then TTs stops at the first available atom. Assumung that A does occur, tTs stops at a and this 
is the correct decision provided there are no further atoms in [0, 1] x [—[t — 5), 0] (which were 
higher than a with probability complimentary to o{6)); i.e. when, essentially, R2 contains no 
PPP-atoms at all. Thus tt^ performs better in Ri with probability 6e~'^st~^. Otherwise, there 
are some further atoms in [0, 1] x[—[t — 6), 0] and vr^ picks the first of them, in which case vr^ fails 
in i?i but may suceed in R2. Conditioning on the number of atoms in [1 — st~^, 1] x [—{t — 6), 0] 
yields probability 



s ~ 



in favour of R2- It follows that 
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Same result in integral form is established by conditioning on the horizontal position of a: 



(16) dtp^{t,s) = ^ ^e«(po(0-Pl(0)d^ 

t Jo 

For t < s, TTs coincides with the greedy algorithm and same argument yields an integral formula 
for the derivative 

(17) P[{t) = t eHpoiO - MO) 

t Jo 

Integration yields, once again, the best-choice probability ([12|) . 

The t = oo model is related to the PPP in the semi-finite 'rectangle' [0, l]x] — oo,0]. 
Although the set of records is now infinite with probability one, the number of records above 
each level —t is finite, and we can therefore speak of a finite best-choice problem embedded in 
the infinite problem (see for details). The value vp is equal to the optimal probability of 



the best choice in the infinite problem. 

2.4 The box-area process. Fix a rectangle of area t and let a be the leftmost record. The 
area to the left from a is distributed like {E — t)^ where E is a. standard exponential random 
variable (the distribution has a defect because in the event E > t the PPP puts no atoms in 
the rectangle). Furthermore, the vertical position of a is uniformly distributed, thus the box 
area of the first record to observe is distributed like {E — t)^ U with U being standard uniform. 

We find it intuitive to think of detector moving at variable speed adjusted to the config- 
uration of records, so that the area of the current box is explored at unit rate. With this 
convention, the time between a and the next detected record is distributed like {E — a{a))j^. 

The random transformation 

(18) t^{E-t)+U 

defines a Markov transition function on nonnegative reals. We define the box- area process to 
be the discrete-time Markov chain with this transition function. Given that the process starts 
at t, its path has the same distribution as the sequence of box areas of consecutive records in 
a rectangle of area t. (Speaking of paths we mean the states visited upon departure from t). 
Each path of the process is decreasing and eventually gets absorbed in 0. 

It is seen that the box-area process is a combination of two classical models. With only the 
first factor present, (|TBp were the homogeneous Poisson process, while setting = we get the 
stick-breaking transformation t ^ tU (which generates a multiplicative renewal process, i.e. 
the exponential of the homogeneous Poisson process). An explicit formula for the transition 
function follows by integrating over the domain {a G -R : a{a) > s} within R = [0, 1] x [0,t] : 

(19) P{t, [s,t]) = J ^^dx e-^d^ = e-'(e' -e' -s J x'^e'^dxj , s<t, 

and the absorption probability is P{t, {0}) = e^*. 

Extending our previous definition define pj{t,s) to be the probability that the box-area 
process has j visits in ]0, s] conditionally on the initial state t (in case t < s we do not count 
t as a visit). In terms of the best-choice problem Pj{t, s) can be interpreted as the probability 
that TTs stops at a record followed by j — 1 further records, in accord with the former definition 
of pi(t, s) in Section 2.2. 
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Obviously, 

Pj{t, s) = Pj(t) for t < s , 
and for t > s the jump-counts distribution is given by the formula 

(20) p,it,s) = ise'pris))Iit,s)+p,is) 

which extends ( |12|) and appears as a solution to the Cauchy problem 

dtPj{t,s) = -Pj{t,s) + r^ f p,{^,t)d^ + t-^ r p^^.iOd^ 

J s Jo 

Pj{s,s) = pj{s), 



in exactly the same way that lead us to ([T^). Computations with ( 120|) are sometimes facilitated 
by replacing the derivative using the formula 

(21) p',{s) = '-^J\HPj-i{0-pAO)d^ 

which can be derived from (^) or proved by analogy with ([T7|). Alternative way to treat the 
derivative is to use recursion (|]), to get the solution in the form 

p,{t,s) = {l-I{t,s))se')p,{s) + I{t,s){{-iy-'p,^,{-s)-e'p,^,s)) 

which also involves a function of negative argument. 

Applying ( pO]) we obtain the probability that vr^ selects some record 

(22) l-po{t,s) = l-e'' + sl{t,s) s<t 

which is also the probability that the minimum box area (which is attributed to the highest 
atom) is less than s. 

Let (j){t, s, x) be the probability that the chain, which starts at t, has the first visit in [0, s] 
within the subinterval [x, s], t > s > x > 0. In extension of (^) we have the relation 

(23) pj[t,s)=f pj-i{x) d^{l - (j){t,s,x)) 

•J 

(differential in x). Distribution (j){t,s,x) satisfies a differential equation of the familiar type 

1 s — X 

dt (f){t, s, x) = -(j){t, s,x) + - / 0(^, s, x) d^ + 



t Js '''''' ' t 

with the boundary condition 

(f){s,s,x) = P(s, [x,s]). 
Solving the equation with the help of Lemma 1 we compute 



d,{l - <j){t, s,x)) = I{t, s)e' + (e-^ - s I{t, s)) f e^y-^ dy . 

Jx 
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The function dx{l — s,x)) is the density of the box area of the record selected by vr^. 
Therefore, the probabihty of best choice has another integral representation which is a special 
case of 



(24) pi{t,s) = / e-"d,(l-0(t,s,x)). 

Jo 

One sees that it is the same as ([l^) by explicit integration based on the identity 



e-^x r e^y-^ dy = J{s) 

Jo Jx 



Setting t = oo can be interpreted as a one-point compactification of the state-space of 
the chain. This corresponds to PPP records in R = [0, 1] x [— oo, 0] and provides natural 
interpretation to t ^ cxd limits. Thus Pj{oo, s) is the distribution of the number of records with 
box-areas less than s. And pi{oo,s) is the probability of best choice in the infinite problem 
when lis is exploited. 

Note that the stopping set of vr^ is a compact set [0,s], when viewed from the box-area 
perspective. Foe R as above, the corresponding set is an infinite domain between a hyperbola 
and the right-side of R. 

Remark. Formula ( pOD has a touch of mystery. Typically it is hardly possible to directly 



express the events underlying probabilities (20) and the like in terms of the PPP configuration. 



For R = [0, 1] X [—t, 0] the second term in the RHS of ( PDj ) could be interpreted as the probability 
of j records above —s, but the first term can be negative, thus it is not at all obvious that the 
sum is positive. 

In case of po{t,s), i.e. for probability of no box areas less than s, a smooth explanation is 
possible, namely via location of the highest atom. Note that is the chance for no atoms (thus 
for no records) above —s. Given this event the ordinate of the highest atom a has substochastic 
density e''^'^l{^g(s_t)}d^. Given the height ^, a must be located to the right from 1~ s/{^ + s), to 
guarantee the box-area not exceeding s, and integrating out yields e"'' — s I(t, s), a probability 
complimentary to (p2D. 



2.5 Path distribution and the EU-representation. In this section we consider semi- 
finite rectangle R = [0, 1] x [— oo, 0] and supply random variables associated with the PPP 
configuration above —t with subscript t. 

Let {Aj) be the sequence of box areas of the records in R enumerated in time- reverse order. 
That is to say, Ai is the box area of the last record (= the highest atom), A2 of the record 
before the last, etc. Let (Ajt) be a finite initial subsequence of (^4^), corresponding to the 
records above —t. Coupling allows to identify {Ajt) with the collection of states visited by the 
box-area process started at t {t itself excluded). As t ^ 00 the sequence (Ajt) converges to 
(Aj) with probability one. 

Warning. As a point set {Ajt) is a truncation of (^4^), but it is not {At) intersected with 
[0,t]. It it a formidable if at all realistic task to directly derive distribution of {Ajt) from the 
distribution of {At). 

With the convenience I{t, s) = for t < s and P-i{s) = the distribution of A^t is 

1 - P{Aj,t <s) = 
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fc-1 



j=0 



fc-1 s 



fc-1 



fe-i(0-p.(0)de + Ep.(^) = 



as a consequence of ( pO]) and (pT|). 

There is a representation for the time-reversed path [Ajt) in terms of standard exponential 
and uniform random variables, which we call the EU-representation. Note that the transform 
t —>■ {E — t)^U which defines the box-area chain is related to the following (distributional) 
construction of the sequence of records in a finite rectangle: skip an exponentially distributed 
area from the left and then break off a uniform portion of the rectangle from below. The inverse 
operation amounts to skipping exponentially distributed area from the top and then breaking 
off a uniform portion from the right. 

The inverse operation makes sense also in semi-finite [0, 1] x [—00, 0], when we identify the 
uniform breaking with selecting a random point on the upper side of the region south-west from 
record. Calculating the box areas we see that (Aj) become jointly represented as 



with (Ej), (Uj) being jointly independent exponential and uniform random variables, respec- 
tively. In the event 



same representation is valid for {An, . . . ,Akt)- For t ^ 00 the constraint (|26|) becomes void 
and we arrive at an interesting conclusion. 

Theorem 1 The distribution of random variable 



(25) 




(26) 





is given by 




fc-1 



Examples. We compute 



P{A, > s) 
PiA2 > s) 



e 



-'-sl{s) 

s /(s) J(s) + e' I{s) - I{s) - s I{s) + + e"" J(s). 



13 



The probability of best choice becomes a difference representation 

Pi(oo, s) = P{Ai <s<A2) = P{A2 > s)- P{Ai > s) = {e' ~s J{s) - l)I{s) + e'' J{s). 

Note that marginal distributions of Ai and A2 alone do suffice for this computation, because 
we always have Ai < A^. 

Remark. Direct computation of the distribution of from the EU-representation works 
smoothly only for k = \. Already for k = 2 the computing requires skillful multidimensional 
integration which was performed in p8| , and (the integration could be a bit simplified 



by expressing the event via marginals and using an explicit formula for the density of sum of 
exponential variables, as found in Feller's textbook). 

2.6 Characterisation. We will show that the distribution of record-counts (|^) uniquely 

characterises the box-area process as a Markov chain. 

Theorem 2 There exists a unique Markov chain on [0, 00) which has absorbing state 0, 
decreasing paths and for any initial state t the distribution of the number of jumps on (0, t) 
given by (|^). 

The idea is to show that linear combinations of functions Pj{-) span C[0, t] for any t. With 
a Stone- Weierstrass argument in mind, we see that the functions separate points and linearly 
independent, thus span an infinite-dimensional space. However it is not clear whether the 
set of finite linear combinations of functions Pj{t) (or of power-series Pj{t)) is closed under 
multiplication (apparently not). We will resolve the complication by proving an inversion 
formula, expressing quasi- monomials as infinite series in the pj's. This will imply that infinite 
series in functions e* pj (t) do form a ring. 

Lemma 2 (inversion formula) For any j 

e-'- = j:^2{k,j)j\{-ir-'Pk{t), 

J- k=j 

where o'2{k,j) are Stirling numbers of the second kind. 



Proof. For any m the matrix (a2{k, {—!)'' A, is inverse to {o'llk, j)/k\)'[^:^^, and both 

matrices are lower-triangular. We need to show that formal inversion of the analogous infinite 
matrices makes sense, i.e. that the involved series converge. 

Sphtting the sum in (Rf) at m, swapping summations and using the finite inversion we obtain 



J2a2{k,j)j\{~l)''-^p,{t) 

k=j \i=k ''■ i=m+l 

J- k=j i=m+l ''■ ''■ 



14 



Denote the rest term by p^. Pulling out the homogeneous factor, we get 



=~^i ^! ^ ~ (m+ 1)! V (m + 1)! ^ (m + 2)! m + 2 ' ' / ^ 
1 H ^ + 7 ^77 ^ + . . . < 7 -TT ■ const 



(m+l)!\ m + 2 (m + 2)(m + 3) "7 (m+1)! 

where the constant does not depend on m. By definition, a2{k,j) is the number of partitions 
of a set with k elements in j parts hence it does not exceed the number of labelled partitions 
in at most j parts, which is j'^. Using the bound, we estimate 

Pm < const ■ —— 2^J < const ■ —— 

(m + 1)!^. (m + 1)! 

where the constant depends on t and j but not on m. Obviously, pm — »■ as m — >• oo, hence 
the series in the inversion formula converges to the conjectured e~^P / j\ □ 

Example. The simplest instance of the inversion formula is 

e-H = piit) - p2{t) + p^it) - . . . 
For higher order monomials the coefficients in the series are unbounded. 

Proof of the Theorem. By the Stone- Weierstrass theorem, linear combinations of monomials 
g-s^fc (quasi-polynomials) are dense in C[0,t]. By the inversion formula, each quasi-polynomial 
in s is representable as a converging series in p^'s, hence these functions span C[0,t] as well. It 
follows that any finite measure /i on [0,t] is uniquely determined by the 'moments' 

t 

Thus, if a Markov chain has transition measure /i(t, ds), decreasing paths, and the distribution 
of jump counts as given by (|^), we must have 

Pk{t) = I Pk-i{s)fi{t,ds) 
Jo 

which determines fJ^it,-) inambiguously. Because this holds for arbitrary t, the transition func- 
tion must coincide with that for the box-area process (|19|). □ 

Remark. With more work, the path monotonicity condition in the theorem can be omitted. 



3 Random horizon problem — vertical cut. 

3.1 The vertical cut problem. Fix a rectangle R of area t and suppose it is partitioned by 
a vertical line V which splits off f/t units of the area from the right, where f/ is a standard 
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uniform random variable independent of the PPP. Suppose the rectangle is scanned from the 
left to the right, and the objective of the observer is to maximise the probability of stopping at 
the atom which is highest among atoms in R to the left from V . 

In the VC-problem the observer knows R and the distribution of V , but the exact position 
of random horizon is unknown. A selection policy should be adapted to the PPP but not to U . 

We make distinction between two versions of the problem. According to version I, the 
observer always knows whether the vertical cut has been approached or not; and, of course, 
stops scanning when V is reached. In version II the observer never learns the position of V . 

The additional information in version I is worthless because there is no essential updating 
of the position of V , and thus the optimal policies are the same. However, formulas for the 
conditional distribution of the predicted number of records are different and only version I has 
a smooth formulation in terms of box areas. We will consider here version I but will return to 
version II in Section 5 on different occasion. 

As in the full-information problem, the shape of R does not matter because the affine 
isomorphism of rectangles with same area also respects a uniform random cut. Each time a 
record to the left from V is detected the conditional distribution of U becomes scaled uniform, 
and this implies readily that an optimal policy must be adapted to the box-area process in R 
which is to be truncated properly to take into account approaching V . 

The exposition to follow is based on same ideas as in the full-information problem, therefore 
we omit many details. Loosely speaking, it is all much the same, but J(s, t) must be replaced 
by the exponential integral of degree 2 

h{t,s):= [ ^ de = — - V - ^(^' ^) ' ^2(s) := /(oo, s). 

Js q St 

Let qj{t) be the probability of j records to the left from V. Conditionally on k atoms in R, 
the distribution of the number of atoms to the left from V is uniform on {0, . . . , fc}, hence 



k=0 v'" ' i=0 



In particular. 



where h{k) :=l + 2 ^ + . . . + k ^ is the harmonic number. Two most important cases are 

qoit) = —^, qiit)= ^ ^. 

A basic relation with functions (^) is 

(28) qj{t)=t-' fp,is)ds. 

<j 

as one sees by averaging over the random horizon. Yet another relation appears when we write 
(^) in the form p'j{t) = —pj{t) + qj-i{t) and compare it with what is obtained by differentiating 
Pj(t) using recursion (H): 

q,{t) = ^ {{-iy-'pj-i{-t) - p,.,{t) 
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The counterpart of becomes 

(29) q'jit) = -(1 + t-')q,{t) + q,^,{s) ds . 

with the newly appearing factor (1 + t^^) reflecting the risk of approaching V at probability 
rate dt/t. But we also have another differential equation which follows from (P^) 

(30) tq'^{t)=p,{t)-q,{t). 

Let u(t) be the optimal probability of stopping at the highest atom to the left from V. The 
DP-equation (dynamic programming) for u becomes 



u'(t) = -u(t)(l + t-^) max(qo(s),u(s))ds, u(0) 

Jo 



0, 



and is resolved by the same method we applied to (|^. Define tp = 2.11982 ... to be the unique 
positive root of any of four equivalent equations: 

Qoit) = qi{t) 
-J{-t) - e-'J{t) = 1 - e"* 
p,{t)-po{t) = l-J(-t) 

oo 1 oo J.k—1 
j=2 J k=j+l 

The uniqueness follows by monotonicity and for the same reason 

qoit) > qi{t) t <tp. 

It follows that we are again in the monotone case of optimal stopping, hence an optimal policy 
is the threshold policy vr^^, prescribing to choose the first record to the left from V with the 
box area less than tp (if any). 

Let qj{t, s) be the probability of j records to the left from V with box areas less than s, in 
a rectangle of area t. Then qi{t,s) is the probabihty of best choice with tt^ and the optimal 
probability equals u{t) = qi{t,tp). 

The relevant Cauchy problem becomes 

(31) dtqj{t,s) = -qj{t,s){l + t-^)+t-^ f%j{^,t) d^ + ^qji^d^, t>s 

J s Jo 

with the initial condition qj{s, s) = qj{s). The analogue of Lemma 1 carries over in the form of 

Lemma 3 Given s > and a constant c suppose a function g is in C^ls, oo) and satisfies 
equation 

g\t) = -g{t) (^1 + 1^ + i g{^) de + ^ , t G [s, oo[ . 

Then 

g{t)=g\s)s^e^h{t,s)+g{s) 
where g'{s) = {cs^^ — (1 + s^^) 9{s)) . 
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and leads to the solution 



(32) qj{t,s) = l2{t,s)s'^e'q'j{s) + qj{s) t > s. 

Using (|30| ) the formula in case j = 1 takes form 

gi(t, s) = hit, s) s{pi{s) - gi(s)) + gi(s) = 

h{t, s){s.]{s) + eV(-s) + J(s)) - -(J(-s) + e-V(s)) 

s 

which is simplified in the limit, when we express l^^s) via /i(s), as 

gi(oo, s) = -l{s){J{-sY + J(s) + sJ(s)) + e-V(s) 

(probability of best choice with vr^ in the infinite problem) . The optimal best-choice probability 
is obtained by substituting s = tp: 

u(t) = qiit^tp) = hit, s) e' s {sJ{s) - + 1) + ^ ~ 



and for t = oo this further simplifies to 

(33) gi(oo, tp) = litp) (e*^ - tpj{tp) - l) + e"*^ J(tp) . 

which is also the optimal probability of best choice in the infinite VC-problem. The right-hand 
side of the last formula is the Petruccelli's value vp. 

The function dtqi{t,s) has a break at t = s for any s ^ tp. Similarly to tp in Section 2.3 
threshold tp can be interpreted as an optimal switching location where q'i{s) becomes tangential 
to a curve Ae~*s^. The 'no-corner' condition at tp characterises this threshold as a unique root 
of tq'((t) + {t + 2)q[{t) = qo{t) — qi(t), and this equation is equivalent to qi(t) = qo(t) because 
qi satisfies the differential equation 

tqi{t) + {t + 2)q[{t) = qo{t)~q,{t) 

which in turn is a consequence of (p9|). 



Remark. The right-hand side of (|33|) appeared first in |21| as the limit best-choice prob- 



ability in fixed-n partial information problem, as described in the Introduction. 

For the random horizon problem, our argument seems to be the first complete proof that 



Petruccelli's formula also yields qi{oo,tp). Porosinski attempted to show that vp is the 
limit in the discrete-time problem with uniform random number of observations, but his argu- 
ment has a gap. On bottom of p. 325 he confused conditional and unconditional best-choice 
probabilities and left without proof an equality on bottom line 2 (which was nevertheless correct 
by coincidence with the Fl-problem, as discovered by Samuels p^). Samuels expressed 



qi{oo,tp) as a multidimensional integral and partly using numerical integration justified the 
value with the precision Mathematica can give. 

3.2 Box-area process. The box-area Markov chain related to the the VC-problem is the 
sequence of box areas associated with the records to the left from V. To make clear distinction 
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with the process introduced in Section 2.4 let us call the new chain Q-process, and the basic 
box-area process the P -process. 

One-step transition of the Q-process is given by the scheme 

t ^ {t - E)Ui l{E<tU2} 

where i?, f/i, U2 are independent exponential and uniform random variables, respectively. This 
can be given a continuous time interpretation, as follows. Starting with area t, during a period 
of length E the area is explored at unit rate unless the process gets absorbed in the meantime, 
with absorption probability rate being ds/s. If the absorption does not occur, at time t — E 
the new box-area is obtained by stick-breaking (t — E) ^ (t — E)Ui. 

We will denote (-Bjt) the time-reverse sequence of states visited by the Q-process conditioned 
on start at t and {Bj) the sequence associated with records in the semi-finite rectangle. To 
unify exposition, let us consider the semi-finite compactified rectangle [0,1] x [— oo,0], with 
obvious interpretation of the random vertical cut. The sequence {Bj) is associated with records 
to the left from V and the sequence {Bjt) with records which are also above —t. This is just 
the coupling approach for the VC-problem. 

Since Q-process is obtained by truncating the set of records, the sequence [Bj] (or (Bjt)) is 
a random shift of [Aj) (respectively {Ajt)) by a few positions. However, there is no transparent 
distributional connection between the processes. 



Digression. Given (Aj) the shift-size depends on the full sequence. This claim is based 
on the following fact about the shape of record sequence (see and 0. In the (unlikely) 
event that a fixed rectangle contains a large number of records they tend to concentrate near 
diagonal, thus a random cut splits away a large part of {Aj), which is certainly not typical. 
Whatever the values of, say Ai, . . . , A^, the number of records in the rectangle is likely to be 
moderate, and the cut isolates a few of the records. Thus looking at a finite piece of (Aj) does 
not allow to definitely decide how many of the entries should be removed to get (Bj). 



Multiplying the integrand in ([T^) by 1 — ^/t we compute the transition function for the 
Q-process as 

^/ r IN e'-^ + t-l-s 
Q{t, [s,t]) = 

and the absorption probability is Q(t, {0}) = qo(t) = (1 — e^*)/t. Another piece of transition 
function is 

(34) Q(t,]o,3]) = ^— ^1^, t>s 

and they are related through Q{t, ]0, s] ) -|- Q{t, [s,t]) = 1 — qo(t) for t > s > 0. 
The transition function satisfies a differential equation 

1\ min(s,t) 



dtQ{t, ]0,s]) = -Q(t, ]0,s]) (1 + ^) + 



t 



obtained by conditioning on the first observation. The equation is valid for arbitrary t and s and 
can be solved directly by separating variables and variation of constant. For future reference 
we note that Q{t) = Q{t, ]0, s]) also satisfies 

(35) tQ" +(t + 2)Q' + Q- l{t<,} (t) = 
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as obtained by differentiation. 

A Q-analogue of s, x), the probability that the process has its first visit on ]0, s] within 
subinterval [x, s] is 

ip{t, s, x) = hit, s)ds Q{s, [x, s]) + Q{s, [x, s]). 

and an integral representation of best-choice probability qi{t, s) follows as in Section 2.4. 

The distribution of counts qj(t) uniquely characterises the Q-process. One way to show this 
is to use an explicit inversion formula which represents monomials e~^P/{j + 1)! as series in 
qjitys, namely with coefficients cr2(j, k)k\ — <J2{j, k + l){k + 1)!. But once we have established 
a similar result for the P-process a reduction is possible. 

Theorem 3 There exists a unique Markov chain on [0, oo) which has absorbing state 0, 
decreasing paths and for any initial state t the distribution of the number of jumps on [0, t] 
given by (|27|). 

Proof. Let us show that the functions qj{s),s G [0,t] span a dense subspace in C[0,t]. Inte- 
grating the inversion formula in Lemma 2 we obtain 

oo 

tk{t)=J2'^2{j,k)k\{-lf-^q,{t) 

j=k 

where 

1 /■* s^ 

= 7 

But since i^s are representable via g^'s, same applies to quasipolynomials which can be recov- 
ered by recursion 

^,(t) = -e-V-VA;!+z,_i(t). 
The density claim follows, and the rest is as in the proof of Theorem 2.n 

3.3 Coupling. We will derive now a formula for dtqi{t,s) to demonstrate some combi- 
natorics behind (P^. Consider R = [0,1] x [— cx),0] sectioned by a random vertical cut V, 
identified with uniform r.v. U. 

Suppose TTs is applied to finite rectangles Ri and R2 as in Section 2.3. The outcomes in Ri 
or i?2 can be different only in the event B that the leftmost atom in Ri, say a, appears in a 
random rectangle [1 — s/t, U] x [—t, —{t — 6)] (which is an empty set in case U < 1 — s/t) in 
which case tt^ restricted to Ri selects a. 

Assuming that B occurs, tTs does right if there are no further atoms in [0, U] x [—(t — 6),0], 
as it happens when U separates a from these atoms. Conditioning on the total number k of 
atoms in Ri we find that the best-choice probability in favour of the larger rectangle Ri is 

where the factor s/t stays for the probability of f/ > 1 — s/t. 
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On the other hand, the advantage for R2 appears when B occurs, some further atoms are 
located to the right from a and to the left from U, and the leftmost of these atoms is the highest 
in [0, U] X [—t, 0]. Conditioning on the total number k of atoms in Ri yields probability 

to the advantage of Hg in R2. Putting two parts together yields the derivative 

g-t 00 g-t 00 ^k 

which is a (quasi-) power-series form of the formula 

f>~i /•min{s,t) 

(36) d,q,{t,s) = —J^ e«e(g,-i(0 -9.(0)de 

analogous to (|TBp . Integrating we re-derive 



3.4 EU-representation. The EU-representation of the path for the Q-process differs from 
that for the P-process only in the first step of the algorithm: obtaining Bi involves uniform 
breaking then exponential skip and repeated breaking, with the first break corresponding to 
the vertical cut. It follows that (Bj) can be jointly represented as 



with the same notation as in (25). The first members of this sequence coincide with Bkt as long 
as the first bracketed factor does not exceed t, and the finite sequence (B^t) converges to {B^) 
almost surely. 



Theorem 4 The distribution of random variable (|37| ) is given by 



P{Bk >s) = -his) ree«gfc(Ode + E^^(«) 
-'o J=l 



Example. Expressing his) via /(s) we have 
P(5i>s) = il-e' + s)Iis) + e-' 

PiB2>s) = e"il + Jis))-Iis)il~e' + e'Ji-s) + Jis) + s + sJis)) 
and a difference representation of the best-choice probability follows via 

qiit,s)=PiB2>s)-PiBi>s). 



3.5 Duality. There is a wonderful duality between P- and Q-processes which reveals 
as coincidence of probabilities of record counts in some finite rectangles and the semi-finite 
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rectangle R = [0,1] x [— oo,0]. A consequence is a series of coincidences in related stopping 
problems. 

Recall that when (Aj) and (Bj) are considered as functions of the same record sequence in 
R we have Aj < Bj. On the other hand, from the EU-representations of the sequences follows 
that if we construct {Aj) through Ej, Uj then a new sequence {B'j) defined by 

5; = A,+i-^i(i-f/i...f/,+i) 

has the same distribution as {Bj). It follows that for any s > 

P{Ai > s)< P{Bi > s)< P{A2 > s) < P{B2 > s) < ... 

which means that sequences {Aj) and (-Bj) are stochastically interlacing. 
Because 

Pj{(X), s) = P{Aj+i > s)- P{Aj > s), 
qj{oo, s) = P{Bj^i > s) - P{Bj > s) 

we can expect that for certain values of s we have 

(38) qj{oo,s) = pj{oo,s) 
and for some other s we have 

(39) qj{oo,s) = pj+i{oo,s). 

We stress that the quantities involved are related to record counts in the infinite R. The miracle 
is that the values of s which solve the equations can be identified as the roots of analogous 
equations involving record counts in a finite rectangle. 

Theorem 5 For positive s equation (pSD is equivalent to gj„i(s) = qj{s). Similarly, equa- 
tion ( P^D is equivalent to Pj{s) = pj+i(s). 

Proof. The equations relating two kinds of functions are 

(40) p'^{t) = -p^{t) + q^_,{t) 

(41) tq'j{t) = Pj{t)-q,{t) 

(the second follows from the definition of qj). Expressing l2{s) via I{s) and using (^T]) we find 
from (H) 

qj{oc, s) = pj{s) - I{s) e' s{pj{s) - qj{s)). 

Now if qj_i{s) = qj{s) holds then by ( ^T]) and (PPj ) also ( P5| ) is valid, and vice versa. Same 
argument works for Pj{s) = pj^i{s).0 

Example. First of all, qi{oo,tp) = pi{oo,tp). That is to say, the optimal pohcy in the 
VC-problem has the same best choice probability fp in both VC- and FI- problems. Another 
coincidence is go(oo,ti?) = pi{oo,tF), saying that the probability that no record is selected by 
TTtp in the VC-problem equals the optimal best-choice probability in the Fl-problem. 
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Remark. Unwillingly, Porosinski proved that pi(oo,tp) coincides with Petruccelli's vp. It 
is this coincidence which vualised a gap in his argument for qi{oo,tp) = vp, see ||2^ and p5| . 



It is not hard to show that equation qj{t,s) = Pj{t,s) always has a solution s for all t 
sufficiently large. Explicitly, for j = 1 the equation becomes 

(Qiis) - qo{s))I{t, s) = —sq[{s) 

and has a solution at least for t > 3. Analogous fact is also valid for the finite— t counterpart 
of (p9|). These solutions depend on t but they converge to the solutions characterised by the 
theorem exponentially fast. 

3.6 A digression. In reply to Samuels' challenge to explain the coincidence vp = pi{oo, tp) 
we feel that there are indeed good reasons to further seek for an explanation but we will not 
dwell thereon. Instead we will show that the phenomenon is not isolated and even a stronger 
coincidence holds for ... an ordinary Poisson process. To stress the similarity we will use in 
this subsection notation confronting with the rest of the paper. 

Consider the homogeneous PP on the positive half-axis, scanned from finite t or cxd to 0. 
Let V he Si standard exponential r.v. independent of the configuration of atoms. The number 
of atoms within [0, s] has distribution Pk{s) = e~*s*^//c!, and because occurence of atoms to the 
right from s does not affect the configuration to the left from s we have trivially Pk{t, s) = Pk{s). 
Given V < t the conditional distribution of the number of atoms on {V, t) is 

1 If* e~* t''^^ 

— * W = « = — WTTy. 

and the distribution of the number of atoms on [V, s] is 

1 - e"-' 

qk{t,s) = zTlkis). 

i — e ^ 

Observe the identity Pk{oo, s) = gfc_i(oo, s). There is no need to write formulas: this follows 
from independence and the fact that the leftmost atom Ai has the same distribution as V. 
Furthermore s = k is the unique positive root of pk{s) = Pk-i{s) and s = A; + 1 is the unique 
positive root of qk{s) = qk-i{s). 

We have therefore 4-fold coincidence 

- • 

Pji^J) =Pj^i{oo,j) = gj_i(oo,j) = gj_2(oo, j) = ^ -y- 

In optimal stopping terms this reads as follows. Denoting Aj the jth smallest atom and Bj the 
jth smallest atom among the atoms to the right from V the rule 'stop at the first atom to the 
left from j' is optimal for recognising Aj, optimal for recognising -Bj-i, and suboptimal but has 
the very same performance for recognising Aj^i and same for Bj^2 (when the value of index j 
makes sense). 

Optimality of threshold j for stopping on Aj was derived by Bruss and Paindaveine in a 
related context of optimal stopping at the jth last success in a sequence of independent trials. 
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Example. For j = 1 and j = 2 there is a relation to the Poisson versions of the 'classical, 
no-information secretary problem' and the 'no-information secretary problem with uniform 
random horizon' (see p5| and for discrete time formulations). Suppose the observer of 



PPP in [0, 1] X [—00,0] exploits a policy 'stop at the leftmost record in [s, 1] x [— oo,0]'. This 
kind of policy is of 'no-information' type in the sense that it is adapted to the one-dimensional 
process of record times, making a decision independent on 'actual value of item but solely 
on its relative rank'. If the objective is to pick the last record, the optimal s is e~^, and if 
the objective is to pick the last record before random vertical cut the optimum is at e~^, as 
everybody knows (and can derive either directly or from results for the discrete-time setting). 
To put the problem into framework of this subsection, recall that the projection of the set of 
records onto horizontal axis is a PP (of record times) to the intensity dt/t; thus applying the 
— log transform we obtain a homogeneous PP on the positive half-axis, and the cut becomes 
an exponential r.v. 

So we have pi(oo, 1) = po(oo, 1) = go(oo, 1) = e^^ which means that the optimum best- 
choice probability in the classical problem equals no-stop probability in this problem and also 
equals no-stop probability with same policy in the random horizon problem. And gi(oo,2) = 
^2(00, 2) = pi(oo,2) = go(oo,2) = 2e~^ reads as boring as: the optimum probability in the 
random horizon problem equals the optimum probability in the problem of stopping at the 
second-last record, equals the best-choice probability with same policy in the poissonised clas- 
sical problem, equals the no-stop probability with same policy in the random horizon problem. 

A good occasion to celebrate the 40th anniversary of secretary problems. 



4 Partial information - horizontal cut. 

4.1 Motivation and setup. We start with a Poisson version of Petruccelli's 'partial informa- 
tion' problem. Suppose the observer aims to select the highest PPP atom in a finite rectangle 
R = [0, t] X [9,9 — 1] of known shape but with unknown vertical position 9. Suppose the online 
information of the observer consists of the PPP configuration in R (but not outside the rect- 
angle), to the left from the detector. Evaluating a policy by its worst-case performance, the 
question of interest is about the maximin policy and maximin probability of best choice. 

A minimal sufficient statistics for ^ is a pair (X, Y) where X is the vertical position of the 
lowest atom, and Y is the vertical position of the highest atom to the left from the current 
position of detector. From spatial independence of the PPP and the nature of the performance 
index follows that we can restrict consideration to policies adapting decisions to these variables. 
A newly appearing feature is that we need to take into account not only the records we consid- 
ered before, which are the upper records, but also lower records (such that there are no other 
atoms to the south-west), because these are exactly the observations necessary to update the 
information about R. 

The problem has obvious shift-invariance in the sense that performance of a policy vr when 
^ = ^0 is the same as performance of a (properly defined) x-shift of tt when 9 = 9q + x, for 
any x. Invoking the 'Hunt-Stein invariance principle' of statistics one sees that we can further 
restrict to invariant policies, whose performance does not depend on the unknown parameter. 
Since a shift-invariant function of {X, Y) depends in effect only on the range Y — X, the range 
and the horizontal position of upper record are the sole parameters of interest when such a 
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record is detected. 

Analysis of invariant policies and related structures is the subject of this section. Because 
performance of invariant policy is independent of 6*, we lose no generality when assuming that 
the rectangle is standardised to R= [0, t] x [0, 1]. 

When an upper record with horizontal position s is detected, the conditional distribution of 
Y given the range r = Y — X is uniform on [1 — r, 1] . It follows easily that the distribution of the 
number of forthcoming upper records is ^^((1— r)(t — s)) with qj{-) as in Section 3 (by symmetry 
same applies to lower records). Repeating the familiar argument, the optimal decision whether 
to stop on upper record or not should be based on the criterion {1 — r){t — s) < tp. 

This suggests that (1 — r)(i — s) is a proper analogue of the box area from the VC-problem, 
and motivates the following definition. For R = [0,t] x [0,1] we call 1 — r the corange and 
the quantity (1 — r)(t — s) the corange-box area. The definition extends obviously to arbitrary 
rectangles. We stress that the corange-box area attributed to an upper record a is determined 
via a and the adjoint record, i.e. the rightmost lower record to the left from a. 

So does the coincidence of stopping pohcies imply coincidence of best-choice probabihties? 
It is a 'yss' we wish to show, but the correct answer in the problem as we formulated it is 'no' 
for a very simple reason: the initial state in the partial information problem is not t. To be 
precise, speaking of the 'initial state' is inapropriate because the range is not defined before 
the leftmost atom in R is detected. In fact, the first observed atom plays a special role: while 
being a unique upper and lower record, it serves as a cut which splits R in two subrectangles 
supporting independent streams of upper and lower records. For conformity with the VC-model 
we shall assume that the range is and the corange-box area is t when the observation starts, 
this is equivalent to assuming that we start with unknown random reference value - the vertical 
position of an observation which is not counted as a record, but must be taken into account 
when establishing if a PPP atom is a record. 

The final step in formulation of our model is swapping the subrectangles resulting from the 
random cut, without changing the orientation. The cutting line becomes the bottom of new 
rectangle while the bottom and the top sides merge into a new random cut. 

The reason for this surgery is threefold. Firstly, we avoid considering two disjoint rectangles. 
Secondly, when R is fixed requiring that a policy should be range- adapted is somewhat artificial 
and it is much more intuitive to think of the problem where the actual coordinates of atoms are 
observed, despite a bit nebulous reward function - probability of best-choice under unknown 
reference value. Finally, we make upper and lower records converge rather than diverge, and 
this point is crucial for a. t = oo extension of the model. 

To summarise, our final formulation of the HC-problem is this. A fixed rectangle R of area t 
is sectioned by a random uniform horizontal cut H. An observer knows it! and the distribution 
of H but not position of the cut. An upper record is defined to be an atom a which is below H 
and is higher than all atoms below H to the left from a; and a lower record is defined to be an 
atom a which is above H and is lower than all atoms above H to the left from a. Each time an 
atom a is detected, the observer learns the coordinates of a and also lerns whether a is above 
or below H. The objective is to recognise the last upper record at the moment it is detected. 

The corange attributed to upper record a is the vertical distance between a and adjoint 
lower record 6, or the vertical coordinate of a if 6 is not defined. All rectangles with same area 
are affinely isomorphic, and the isomorphism respects the PPP, a uniform horizontal cut, and 
the structure of upper and lower record processes. At each stage the conditional distribution of 
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H is uniform within the corange interval spanned on the current upper record and its adjoint. 
An optimal policy, say vftp, stops at first atom which has corange-box area less than tp. 

4.2 VC=HC: quick proof. Apparently, the most complex and confusing feature in the 
HC-problem is that both upper and lower records affect the state. Let us look at the evolution 
of the corange in details. Start with R = [0,t] x [0, 1], thus the initial range is and corange 
1. The waiting time for the first change is a truncated exponential r.v. which is related with 
the leftmost atom a to detect. The vertical position of a, say Z, is uniform, independent 
of H, thus the new range has the same distribution as a spacing, i.e. the size of interval 
between H and Z, and the new corange has the same distribution as max {1/1,1/2) for two 
uniform r.v.'s. It follows that one-step decrement of the corange-box area is described by 
scheme t — > (t — E)^ max {1/1,1/2). Independently of the decrement, Z < H or Z > H with 
same probability 1/2 , by exchangeability. In the event Z < if we have an upper record, and 
a lower record otherwise (the upper records occur below H). It is seen that an upper record 
occurs after a geometric number of lower-record observations, provided the corange-process is 
not absorbed at in the meantime. 

This description allows to write a DP-equation for the best-choice probability w{t). Using 
the form of vfj^, 

1 i'mm{i,tp/t) 1 /-l 1 /-l 

w'{t) = —w{t) + - / qo{tx) dx"^ + - / w{tx) dx"^ + - / w{tx) dx'^ . 

2 Jo 2 Jmm(l,tp/t) 2 Jo 

Integration is over corange decrement x having the max (?7i, ?72)-distribution x^; the third 
integral term stands for the event that the first atom to observe is a lower record, the first and 
second integral terms stand for the events that the first observation is an upper record and it 
is selected or skipped, respectively. It is instructive to put DP-equation for the VC-problem in 
similar form 

rmin{l,tp/t) rl 

u'{t) = -u{t){l + r^) + qo{tx)dx+ u{tx)dx. 

Jo Jmin{l,tp/t) 

To see that the equations are equivalent assume t > tp, substitute x = and differentiate. 
This yields same tu" = —{t + 2)u' (recall that it was tv" = —{t + l)v' in the Fl-problem). 
From optimality of tp follows that solutions coincide for t < (as can also be seen from the 
equations directly) and both u', w' are equal at tp and continuous, thus passing to higher order 
differential equation does not alter solution. 

Although this argument offers a little of an explanation, the promised coincidence (H) follows. 

4.3 Corange-box area process. We define corange-box area process only for upper-record 
observations. Thus between two upper records, arbitrarily many lower records can contribute 
to the change of state. 

Theorem 6 The corange-box area Markov chain asociated with the HC-problem has the 
same distribution as the Q-process in the VC-problem. 

First proof. The number of visits in each interval ]0,t] has the same distribution qj{t) as for 
the Q-process. But by Theorem 3 such a process is unique, thus the processes have the same 
distribution. □ 



26 



Second proof is based on computing the transition function for the corange-box area chain. 
Denoting temporarily the transition probabihty Q we can write 

^ ^ \ --^ 1 /■min{s/t,l) 

(42) dtQ{t,]0,s]) = -Q{t,]0,s]) + - Q{tx,]0,s])dx' + - dx' . 

2 Jo 2 Jo 

The first integral term stands for the event that the first atom to observe is a lower record, in 
which case there is no transition from t to ]0, s] and the new corange is tx. The second term 
stands for the event that the first observation is an upper record, and the decrement is larger 
than t — s in case t > s or arbitrary in case t < s. 

To transform P^ ) change the variable of integration to ^ = - this yields factor at the 
integral - then multiply equation by t^, differentiate and divide by t. The integral goes and 
we see that Q(t, ]0,s]) satisfies (P3|), same equation as for Q(t, ]0,s]). Both functions coincide 
with 1 — go(^) for t < s and there is no break at t = s, thus by uniqueness 

Q(t, ]o,s]) = g(t, ]o,s]). 

It follows that the corange-box area process in the HC-problem is identical, stochastically, with 
the Q-process of genuine box areas from VC-problem. 

4.4 Hor-Ver choice. A randomised model enables to couple VC- and HC-problems and 
to introduce some symmetry. Suppose a square is partitioned by uniform random horizontal 
and vertical cuts H and V which meet at point O. Two observers Ver and Hor learn the PPP 
configuration in the square as the same vertical detector moves from the left to the right. Each 
of the observers can drop out each time an atom is detected and the stop is a win if the last 
detected atom is the highest among the PPP atoms in the square south-west from O. Hor 
knows the position of V but not H; each time an atom is detected she is told if the atom is 
above H or below. Ver knows the position of H but not V. 

Call an atom a 'upper record' if a is the highest among all the PPP atoms below H seen so 
far. Both Hor and Ver hunt for the last upper record in the rectangle with vertex O. Call an 
atom a 'lower record' if a is the lowest among all PPP atoms above H seen so far. 

The appeal of this model is that the observers learn the same configuration and have the 
same objective. The surprise is that they perform equally well by using very different policies, 
optimal for different kinds of information flows. Clearly, the PPP configuration to the right 
from V is of no interest for Hor, who will stop at the first upper record a which has the corange 
area less than tp. Similarly, the configuration above H will be ignored by Ver, who will stop 
at the first upper record a with the area of 2-dim interval (a, O) less than tp. 

Generically, they stop at different atoms, but both succeed with same probability 

1 /■* 

- / qi{s,tp)ds 
t Jo 

which is close to fp when the side of the square t^/^ is sufficiently large. 

4.5 EU-representation. A model of infinite record processes leads to a EU-representation 
of the corange-box area chain, and offers a framework for asymptotic condiderations in the HC- 
problem. The role of these considerations is somewhat limited by the fact that there is no 
obvious infinite analogue of the stopping problem nor embedding of finite-t record processes. 
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Consider PPP in the infinite strip R — Rj^\J R_ with R = [0, 1] x [—00, 0] and i?+ = 
[0, 1] X [0, 00]. Define an atom to be a lower record if a G R+ and is lower than all atoms in 
i?+ to the left from a. Define an atom a e R- to be an upper record if a e R- and is higher 
than all atoms in R_ to the left from a. The definition agrees with that of Section 4.1 when 
the horizontal axis is understood as a fixed cut. 

Enumerate the upper records aj, from the right to the left (in reverse observation order). 
A lower record bj is called adjoint to aj if bj is the rightmost lower record to the left from aj. 
Note that in the infinite setting the adjoint record is defined with probability one. The corange 
at ttj is the vertical distance between aj and bj. 

Sequence (bj) has repetitions. The stick-breaking interval partition of [0, 1], which is induced 
by horizontal projection of (%), has intervals containing at most one point of the projected 
sequence (bj), and each time a partition interval is empty we have a repetition. 

A joint EU-representation for (aj), (bj) is complicated and we will not attempt describing 
it. But there is a representation for the corange boxes very much similar to what we had in 
Sections 2 and 3. Let Cj be the corange-box area at aj. 

Theorem 7 Sequence {Ck) can be jointly represented as 




Proving marginal representation is easy. Indeed, for j fixed a EU-representation for coordi- 
nates of a,- is 



On the other hand, given Ui = Ui, i < j the ordinate of the adjoint lower record bj is condition- 
ally independent of the ordinates of ,i < j — I, and is distributed like Ej^i{ui ■ ■ ■ Uj)^^. This 
yields the representation of corange at aj. Justifying the joint distribution is more involved, 
requiring some preparation. 

Lemma 4 Let Ei,E2 be i.i.d. exponential r.v.'s, independent of uniform V. Then 



C/i • • • Uj and - \Ei + ...+ 





El E2 



E 



Ui U2 



where Ui,U2 G [0, 1] and E is a standard exponential r.v. 



Proof. Expanding the nth power of the LHS yields an expression 




which has expectation 
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equal to the nth moment of the RHS. Since the moments characterise the exponential distri- 
bution uniquely we are done. □ 

Proof of the theorem. Consider 

a coordinate- wise EU-representation for upper records. Given (Uj) = {uj) we will construct a 
distributional copy of the corange sequence. To this end, we need a further supply of indepen- 
dent exponential and uniform r.v.'s {Gj) and (Vj), also independent of (Fj). 

We have ai = [ui , —Fi) and the adjoint lower record can be written as hi = {uiVi , GiUi^) 
so that 

is the smallest corange-box area. Note that the first component of bi is (conditionally) inde- 
pendent of oi and Ci. If ViUi < U1U2 then bi = 62 and if ViUi > U1U2 there in an increment 
and 62 = GiUi^ + G2{uiU2)~^- Continuing so forth, given ai, 61, ... , %, bj and given Ci, . . . , C, 



the horizontal position of bj+i is distributed like VjUi ■ ■ - uj and we have a repetion bj+i — bj 
exactly when VjUi ■ ■ - uj < Ui - ■ ■ mj+i. 

Each time Vj > Uj+i both a^+i and bj+i contribute to the corange increment (or decrement 
when viewed in the right observation order). Thus we arrive at a representation which should 
be clear from the j — 3 case: 

/ F2 F3 G\ G2 G3 \ , , 

C^3 = [Fi^ \ \ \ l{Vi>«2} H ^{V2>uz) (1 - U1U2U3). 

The terms 

\ 1{V2>«3} 

U1U2 U1U2U3 

are present neither in Ci nor in C2 thus we can painlessly replace them by £'3(^1^2 'tis) 
without destroying the joint distribution of (Ci, C2, C3). The next substitution 

F2 G2 E2 

\ -'-{Vl>U2} — 

Ui U1U2 U\U2 

should be performed simultaneously in G2 and C3. A complete proof follows by induction in 

Note that the representation does not show the cumulative contribution of upper records 
versus cumulative contribution of lower records. The theorem implies a distributional identity. 

Corollary. {Bj) = (Q). 

Example. The simplest instance of the distributional identity is i?i = Ci, which is 

^ (1 - C/,C/2) ^ + ^) (1-C/i). 
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The reader is advised to visually compare the EU-representations for {Aj), (Bj) and [Cj) 
and to attempt deducing Bj = Cj for j = 1, 2 by integration (see |2^). 

5 Extensions and compliments. 

5.1 Duration problem. Consider the PPP in i? = [0, t] x [0, 1], with horizontal axis interpreted 
as time scale. Suppose that stopping at a record at time s yields a reward equal to the horizontal 
distance between the record selected and the next record to observe, or equal to t — s if no 
record follows. This is the 'full-information case of the duration problem' introduced in by 
Ferguson et al [0, p. 55. It was shown in [11| that the optimal rule is ntp and recently, in 



fixed-n context, the value is asymptotic to tvp, see pT |. 

It is the aim of this section to show that the duration problem is nothing else but a minor 
variation of the VC-problem, namely its vualised version II. 

Suppose the first atom to observe is in the origin a = (0,0). The expected reward from 
stopping is then 



/ e ""xdx + te * = 1 - e * = tgo(^) 
Jo 



10 

where the second term in the LHS stands for the event that no further records occur. Similarly, 
stopping at atom a at time s yields a reward (t — s)qo{a{a)), where the box area is given by 
a(a) = {1 — x)(t — s) for a = (s, x). 

Now recall that in version II of the VC-problem the observer does not know if the horizon 
has been approached. Thus when an atom a is detected the conditional probability of best 
choice is equal to 

—^qo{a{a)) 

where the first factor is the chance that a is to the left from V and the second factor is the 
conditional probability of best choice given that a is indeed to the left from V. Thus the payoff 
in version II differs by constant factor from that in the duration problem. But version II is 
equivalent to version I, therefore in the duration problem the expected reward with tt^ is simply 
tqi(t, s), the optimal policy is ntp and the 'maximum expected duration of holding a record' is 
tu{t) = qi{t,tp). For any t. 

5.2 Bin-packing. Suppose there is a bin of unit capacity. To-be-packed items of random 
uniform- [0, 1] size arrive at the epochs of a homogeneous Poisson process. An item is irrevocably 
packed immediately at the time of arrival provided there is enough room in the bin left (greedy 
policy). The problem is to recognise the last packing at the time it occurs. 

A minute thought shows that the state variable in the problem is the product of the remain- 
ing capacity and the expected number of Poisson epochs to come. The probability law of this 
process is stochastically equivalent to the box-area process. And this implies that the problem 
is equivalent to the FI best-choice problem. 

5.3 Additive representations of best-choice probability. Of some interest are repre- 
sentations in the form of a sum of probabilities of events expressed explicitly via PPP config- 
uration. Decompositions of this kind are tractable logically, but not analytically because they 
cannot be expressed in invariant terms, i.e. using box areas. 
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Samuels developed such decompositions for FI-, VC- and HC-problems |2^. In the FI case 
his representation of 



Pi(oo, s) = P{Ai < s <A2) = 
P(^i(l -Ui)<s< (El + E2/f/i)(l - f/if/2)) 

is based on testing the inequality E2/U1 > — Ui) — Ei and has two parts 

P(Ei(l-f/i)<s<(Ei + E2/f/i)(l-f/i)) = (e^-l)/(s) 
P((Ei + E2/f/i)(l-f/i)<s<(^i + E2/f/i)(l-f/if/2)) = {e"' -sI{s))J{s). 

Loosely speaking, Samuels' decomposition makes distinction between the cases when the vertical 
distance between the last and second last records is large or small. 

Another decomposition appears when we concentrate on both the highest atom a and an 
atom h which is the highest among PPP atoms below — s, within the R = [0,1] x [— oo,0]. 
Indeed, suppose the event Ai < s < A2 occurs. There are three cases: a{b) > s, or a{b) < s 
and 6 is a record, or a{b) < s and b is no record. Let ^ < — s be the vertical position of b. 

In the first case the horizontal position of b must be within [1 — s/^, 1], and we must have 
A2 > Oi{b) and a as the unique record above —s. Integrating yields 

e^-«(l - s/Opi{s) d^ = (e-^ - sI{s))J{s) 

In the second case b coincides with a. In the third case there must be exactly one record 
above —s to the left from b and no atoms above —s to the right from b. We failed to eval- 
uate probabilities in the two last cases directly, as it involves the not-so-easy integration of 
x~^exp{—x + c/x) (an instance of generalised incomplete gamma function studied in p]]). 
Thus we could deduce the total probability of these cases, (e^ — l)/(s), only from the formula 
for pi{oo, s) and the first case. 

The moral of this is that the second decomposition yields the same two terms as that of 
Samuels, although it is based on a completely different principle. This offers a new puzzle 
because distribution of b does not fit in the EU-representation for records since b need not be 
a record at all. 

5.4 Beyond the box areas. The box-areas approach is good for study 'time-space invari- 
ant' functionals of the PPP records, but is of limited value when we need to explicitly separate 
the coordinates. Nevertheless, the invariance helps to study more general functionals as well. 
Next examples illustrate the matters in the context of FI problem. 

Example: distribution of stopping time. Consider threshold policy vr^ in i? = [0, 1] x 
[—00, 0]. Being a stopping time, TCg accepts some value within [0, 1[ - coordinate of the selected 
atom - or is indefinite if no atom is selected. Let f{t,C,, s) be the probability that the selected 
atom is above —t and to the left from ^ for t G [0, 00], ^ G [0, 1[ . For t < s we have f{t, ^, s) = 
1 — e~*^ because tt^ stops if there is such an atom. For t>s/(l— ,^)we have dt f{t, ^, s) = 0, 
as is easily seen by drawing a hyperbolic stopping boundary for vr^. And for t G [s, s/ (1 — ,^)] 
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because the choices in two close rectangles of heights t — 5 and t are only different when the 
atom highest for the configuration on [0, ^] x [—00, 0] is in the (5-strip. Integrating we find that 
for all t > s/{l - 




independently of t, it is therefore the distribution for semi-finite R. When ^ ^ 1, we have 
/(oo, ^,s) 1 — + s I{s) which is 1 — po{oo, s), probability that vr^ ever selects an atom. 



In Section 2.4 we derived an integral representation ([2^ ) of the best-choice probability in 
terms of the box-area process. Next example gives similar 'real-time' rate, a Poisson analogue 



of the 'probability of win at a given draw' introduced in |jT2|, p. 57. 



Example: the best-choice probability rate. In the framework of the previous example, 
let g(t,^,s) be the probability that the last record appears before ^ and is selected by tt^, 
^ G [0, 1]. Think of d^g{oo,C,, s) as a winning probability rate at time ^ G [0, 1], so that the 
total best-choice probability pi(oo, s) is obtained by integration over ^ G [0, 1]. 

It is not hard to see that digit, s) = for t > s/{l — ^), because the atoms south-west 
from the point (,^, —t) are outside the stopping region {(x, —t) : (1 — x)t < s}. It follows that 
9(00, ^,s) = g{s/{l-^),^,s). 

For t < s/{l — ^) we will find the derivative dtg{^, s) by the coupling technique. Consider 
two rectangles Ri = [0,^] x [— t, 0] and R2 = [0,C,] x [—{t — 6),0]. Policy vr^ stops at distinct 
atoms in Ri and R2 if the first record, say a, with box area less than s appears in the strip 
[(1 — ,^] X [—[t — 6) ,t]. Let X be the horizontal coordinate of a. When a is the overall 
last record iis wins in Ri but not in R2. The counterpart is more complex: vr^ wins in R2 but 
not in Ri if after a there are k > atoms in [x, 1] x [—t, 0], the leftmost of these atoms appears 
within [x,^] x [— t, 0] and it is the last record; the probability of this event is computed via 
distribution of the minimum in a sample of size k and using the fact that vertical ranking is 
independent of the arrival time. Integrating over x yields 

Differentiating in ^ and then integrating over t from to s/(l — ^) and finally converting the 
series into exponential integral functions we obtain a formula missed in the fundamental 1966 
paper: 

..(^.., ^ -e-^ . ^ . - ^ (. - . 

- complicated but correct! 

The grouping of terms was selected to show that the rate is an entire function in ^. For 
,^ = and 1 the values are 1 — e* and e"'', respectively, in accord with Figure 3 from |T^], 
corresponding to the optimal threshold s = tp. 

Remark. At the end of Section 3d, Gilbert and Mosteller write: "Theory we do not give 
shows that, for large n, the probability of winning on any draw with the optimum strategy ... 
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is roughly (1 — e~'^)/n ..." (with c = tp = 0.804- ■ ■). Now we know that this roughly means, 
in spirit of their one-paragraph Section 3e, precisely that up to higher order terms, probability 
of win at draw i is d^g{oo, i/n, tp) where the function is close to 0.6 for most of the range 
^ G [0, 1]. A Mathematica-diawn graph of d^g{oo, ■ ,tp) demonstrates perfect agreement with 
Figure 3 in [|g, p. 58. 
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